Goto

Collaborating Authors

 Niobrara County


What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Li, Ming, Li, Yanhong, Zhou, Tianyi

arXiv.org Artificial Intelligence

What makes a difference in the post-training of LLMs? We investigate the training patterns of different layers in large language models (LLMs), through the lens of gradient, when training with different responses and initial models. We are specifically interested in how fast vs. slow thinking affects the layer-wise gradients, given the recent popularity of training LLMs on reasoning paths such as chain-of-thoughts (CoT) and process rewards. In our study, fast thinking without CoT leads to larger gradients and larger differences of gradients across layers than slow thinking (Detailed CoT), indicating the learning stability brought by the latter. Moreover, pre-trained LLMs are less affected by the instability of fast thinking than instruction-tuned LLMs. Additionally, we study whether the gradient patterns can reflect the correctness of responses when training different LLMs using slow vs. fast thinking paths. The results show that the gradients of slow thinking can distinguish correct and irrelevant reasoning paths. As a comparison, we conduct similar gradient analyses on non-reasoning knowledge learning tasks, on which, however, trivially increasing the response length does not lead to similar behaviors of slow thinking. Our study strengthens fundamental understandings of LLM training and sheds novel insights on its efficiency and stability, which pave the way towards building a generalizable System-2 agent. Our code, data, and gradient statistics can be found in: https://github.com/MingLiiii/Layer_Gradient.


Establishing Nationwide Power System Vulnerability Index across US Counties Using Interpretable Machine Learning

Ma, Junwei, Li, Bo, Omitaomu, Olufemi A., Mostafavi, Ali

arXiv.org Artificial Intelligence

Power outages have become increasingly frequent, intense, and prolonged in the US due to climate change, aging electrical grids, and rising energy demand. However, largely due to the absence of granular spatiotemporal outage data, we lack data-driven evidence and analytics-based metrics to quantify power system vulnerability. This limitation has hindered the ability to effectively evaluate and address vulnerability to power outages in US communities. Here, we collected ~179 million power outage records at 15-minute intervals across 3022 US contiguous counties (96.15% of the area) from 2014 to 2023. We developed a power system vulnerability assessment framework based on three dimensions (intensity, frequency, and duration) and applied interpretable machine learning models (XGBoost and SHAP) to compute Power System Vulnerability Index (PSVI) at the county level. Our analysis reveals a consistent increase in power system vulnerability over the past decade. We identified 318 counties across 45 states as hotspots for high power system vulnerability, particularly in the West Coast (California and Washington), the East Coast (Florida and the Northeast area), the Great Lakes megalopolis (Chicago-Detroit metropolitan areas), and the Gulf of Mexico (Texas). Heterogeneity analysis indicates that urban counties, counties with interconnected grids, and states with high solar generation exhibit significantly higher vulnerability. Our results highlight the significance of the proposed PSVI for evaluating the vulnerability of communities to power outages. The findings underscore the widespread and pervasive impact of power outages across the country and offer crucial insights to support infrastructure operators, policymakers, and emergency managers in formulating policies and programs aimed at enhancing the resilience of the US power infrastructure.


DATAMETREX APPOINTS NEW PRESIDENT OF NEXALOGYDatametrex

#artificialintelligence

Toronto, Canada, November 2, 2022 – Datametrex AI Limited (the "Company" or "Datametrex'') (TSXV: DM) (FSE: D4G) (OTCQB: DTMXF) is pleased to welcome Mr. Charles Park, CFA, as the President of Nexalogy. Charles brings a breadth of knowledge cultivated from over a decade in the banking, technology, and online gaming industries. An engineer at heart, Mr. Park obtained his Bachelor of Science at the Korean Advanced Institute of Science and Technology (KAIST). He went on to attend the accelerated International Graduate Program at the Standard Chartered Bank where he began as an FX Trader and transitioned to Corporate Lending, where he led several substantial Commercial Real Estate (CRE) deals. It was during this time he became an accredited Charted Financial Analyst (CFA). In the last five (5) years, Charles has taken on significant operation roles at the upstart NSUS Group, helping it grow from a startup to a major player in the Online Gaming space. "Datametrex is focused on expanding Nexalogy.

  Country:
  Genre: Press Release (1.00)
  Industry: Banking & Finance > Real Estate (0.56)